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DNA RECOMBINATION IN EUKARYOUC CELLS BY THE 
BACTERIOPHAGE PHIC31 RECOMBINATION SYSTEM 



STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH 

This invention was made with government support under Grant No. 5335- 
21000-009-06S, awarded by the United States Department of Agriculture, Agricultural 
Research Service. The Govemmait has certain rights in the invention. 

CROSS-REFERENCE TO RELATED APPLICATION 

This appUcation claims the benefit of US Provisional Application No. 
60/145,469. filed July 23, 1999, which application is incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention pertains to the field of methods for obtaining specific and 
stable integration of nucleic acids into chromosomes of eukaryotes. The invention makes use 
of site-specific recombination systems that use prokaryotic recorabinase polypeptides, such 
astheOCSl mtegrase. 

Background 

Genetic transfonnation of eukaryotes often suffers from significant 
shortcomings. For example, it is often difficult to reprxKiucibly obtain integration of a 
transgene at a particular locus of interest. Homologous recombination generally occurs only 
at a very low fi:equency. To overcome this problem, site-specific recombination systems 
have been employed. These methods involve the use of site-specific recombination systems 
that can operate in higher eucaryotic cells. 

Many bactaiophage and integrative plasmids encode site-specific 
recombination systems that enable the stable incorporation, of their genome into those of 
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their hosts. In these systems, the minimal requirements for the recombination reaction are a 
recombinase enzyme, or integtase, which catalyzes the recombination event, and two 
recombination sites (Sadowski (1986) J. Bacterial. 165: 341-347; Sadowski (1993) FASEB 
J. 7: 760-767). For phage integration systems, these are referred to as attachment (oa) sites, 
with an attP elanent from phage DNA and the attB element encoded by the bacterial 
genome. The two attachmeait sites can share as little sequence identity as a few base pairs. 
The recombinase protein binds to both att sites and catalyzes a conservative and reciprocal 
exchange of DNA strands that result in integration of the circular phage or plasmid DNA 
into host DNA. Additional phage or host factors, such as the DNA bending protein IHF, 
integration /lOst/actor, may be required for an efiBcient reaction (Friedman (1988) Cell 
55:545-554; Finkel & Johnson (1992) Mol. Microbiol. 6: 3257-3265). The reverse excision 
reaction sometimes requires an additional phage factor, such as the xis gene product of phage 
X (Weisberg & Landy (1983) "Site-specific recombination in phage lambda." In Lambda U, 
eds.Hendrixera/. (Cold Spring Harbor Laboratory, Cold Spring Harbor, NY) pp.21 1-250; 
Landy (1989) ^/mi. Rev. Biochem. 58: 913-949. 

The recombinases have been categorized into two groups, the X integrase 
(Argos et al (1986) EMBOJ. 5: 433-44; Voziyanov et al (1999) Nucl. Adds Res. 27: 930- 
941) and the resolvase/invertase (Hatfull & Grindley (1988) "Resolvases and DNA- 
invertases: a family of enzymes active in site-specific recombination" In Genetic 
Recombination, eds. Kucherlipati, R., & Smith, G. R. (Am. Soc. Microbiol., Washington 
DC), pp. 357-396) families. These vary in the stracture of the integrase enzymes and the 
molecular details of their mode of catalysis (Stark et al. (1992) Trends Genetics 8: 432-439). 
The temperate Streptomyces phage OC3 1 encodes a 68 kD recombinase of the latter class. 
The efiScacy of the C>C31 integrase enzyme in recombining its cognate attachment sites was 
recently demonstrated in vitro and in vivo in recA mutant Escherichia coli (Thorpe & Smith 
(1998) Proc. Nat 1 Acad. Sci. USA 95: 5505-5510). The OCS 1 integration reaction is simple 
in that it does not require a host factor and q)pears irreversible, most likely because an 
additional phage protein is required for excision. The phage and bacterial att sites share only 
fliree base pairs of homology at the point of cross-over. This homology is flanked by 
inverted repeats, presumably binding sites for the integrase protein. The minimal known 
fimctional size for both attB and attP is -50 bp. 
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The Cre-fox system of bacteriophage PI, and the FLP-FRT system of 
Saccharomyces cerevisiae have been widely used for transgene and chromosome 
engineering in animals and plants (reviewed by Sauer (1994) Curr, Opin, Biotechnol 5: 521- 
527; Ow (1996) Curr. Opin. Biotechnol 7: 181-186). Other systems that operate in animal 
5 or plant cells include the following: 1) the Vi-RS system from Zygosaccharomyces rouxii 
(Qnouchi et al (1995) Mol Gem Genet 247: 653-660), 2) the Gin-^ system from 
bacteriophage Mu (Maeser & Kahmann (1991) Mol Gen, Genet 230: 170-176) and, 3) the p 
recombinase-ju system from bacterial plasmid pSM19035 (Diaz et al (1999) J. Biol Chem. 
274: 6634-6640). By using the site-specific recombinases, one can obtain a greater frequency 
10 ofintegration. 

However, these five systems suffer Scorn a significant shortcoming. Each of 
these systems have m common the property that a single polypeptide recombinase catalyzes 
the recombination between two sites of identical or nearly identical sequences. The product- 
sites generated by recombination are themselves substrates for subsequent recombination. 
15 Consequently, recombination reactions are readily reversible. Smce the kinetics of 

intramolecular interactions are favored over intermolecular interactions, these recombmation 
systems are efficient for deleting rather than integrating DNA. Thus, a need exists for 
methods and systems for obtaining stable site-specific integration of transgenes. The present 
invention fiilfills this and other needs. 

20 SUMMARY OF THE INVENTION 

The presait invention provides methods for obtaining stable, site-specific 
recombination in a eukaryotic cell. Unlike previously known methods for site-specific 
recombination, the recombinants obtained using the methods of the invention are stable. The 
recombination reaction is not reversible. 

25 The methods involve providing a eukaryotic cell that comprises a first 

recombination site and a second recombination site, which second recombination site can 
serve as a substrate for recombination with the first recombination site. The first and the 
second recombination sites are contacted with a prokaryotic recombinase polypeptide, 
resulting in recombination between the recombination sites, thereby forming one or two 

30 hybrid recombination sites. Significantly, the recombinase polypeptide is one that can 
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mediate site-specific recombination between the first and second recombination sites, but 
cannot mediate recombination between the two hybrid recombination sites in the absence of 
an additional phage-produced factor that is not present in the enkaryotic cell. Either or both 
of the recombination sites can be present in a chromosome of the eukaiyotic cell. In some 
5 embodiments, one of the recombination sites is present in the chromosome and the other is 
included within a nucleic acid that is to be mtegrated into the chromosome. 

The invention also provides eukaryotic cells that contain a prokaiyotic 
recombinase polypeptide or a nucleic acid that encodes a prokaryotic recombinase. In these 
embodiments, the recombinase is one that can mediate site-specific recombination between a 

10 first recombmation site and a second recombination site that can serve as a substrate for 
recombination with the first recombination site, but m the absence of an additional factor 
that is not present in the eukaryotic cell cannot mediate recombination between two hybrid 
recombination sites that are formed upon recombination between the first recombination site 
and the second recombination site. In presently preferred embodiments, the cells of the 

15 invention include a nucleic acid that has a coding sequence for a recombinase polypeptide. 
The recombinase coding sequence is preferably operably linked to a promoter that mediates 
expression of the recombinase-encoding polynucleotide in the eukaryotic cell. The 
eukaryotic cells of the invention can be an animal cell, a plant cell, a yeast cell or a fungal 
cell, for example. 

20 In additional embodiments, the invention provides methods for obtaining a 

eukaiyotic cell having a stably mtegrated transgene. These methods involve introducing a 
nucleic acid into a eukaiyotic cell that comprises a first recombination site, wherein the 
nucleic acid comprises the transgene of interest and a second recombination site which can 
serve as a substrate for recombination with the first recombination site. The first and second 

25 recombination sites are contacted with a prokaryotic recombinase polypeptide. The 
recombmase polypeptide catalyzes recombination between the first and second 
recombination sites, resulting m integration of the nucleic acid at the first recombination site, 
thereby forming a hybrid recombination site at each end of the nucleic acid. Again, the 
recombinase polypeptide is one that can mediate site-specific recombination between the 

30 first and second recombination sites, but cannot mediate recombination between two hybrid 
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recombination sites in the absence of an additional factor that is not present in the eukaryotic 
ceU. 

Additional embodiments of the invention provide nucleic acids that include a 
polynucleotide sequence that encodes a bacterial recombinase polypeptide operably linked to 
5 a promoter that fimctions in a eukaryotic cell. The recombinase polypeptides encoded by 
these nucleic acids of the invention cannot mediate recombination between two hybrid 
recombination sites that are fomied upon recombination between a first recombination site 
and a second recombination site in the absence of a bacteriophage factor that is not present in 
the eukaryotic cells. In some embodiments, the nucleic acids further include at least one 
1 0 recombination site that is recognized by the recombinase polypeptide. 

Also provided by the invention are eukaryotic cells that include a 
polynucleotide that has one or more bacteriophage OC31 recombination sites, or 
recombination sites for other recombinases that cannot mediate recombination between two 
hybrid recombination sites that are formed upon recombination between a first 
15 recombination site and a second recombination site in the absence of a bacteriophage factor 
that is not present in the eukaryotic cells. 

BRIEF DESCRIFnON OF TBQE DRAWINGS 
Figure 1 shows a schematic (not to scale) representation of the chromosome 
structure at the pombe leul locus. Homologous insertion of pLT44 into the chromosome 
20 (Figure 1 A) places a OC3 1 attP target between leul alleles as shown in Figure IB. pLT43 
promoted site-specific integration of pLT45 into the chromosomal attP target leads to the 
structure shown in Figure IC. Arrowheads indicate PGR primers corresponding to the T7 
promoter (T7), T3 promoter (T3) and ura4^ coding region (U4), Predicted sizes ofXbal (X) 
cleavage products are shown. 
25 Figure 2 shows a schematic of an experiment which demonstrated that OC3 1 

integrase catalyzes site-specific integration of a transgene encoding green fluorescent protein 
(GFP) in CHO cells. 

Figure 3 shows a schematic diagram of an experiment which demonstrated 
that <S>C3 1 catalyzes specific recombination at an attB site to insert a hygromycin 
30 phosphotransferase gene downstream of a chromosomally located promoter. Successful 
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integration produces a Pc-attL-hpt linkage and a hygromycin resistance phenotype. The 
effect of different lengths oiattP and attB sites were analyzed using the plasmids indicated. 

Figure 4 shows a schematic diagram of an experiment which demonstrates 
that OCSl integrase catalyzes the excision of a DNA flanked by attB and attl? sites from the 
5 tobacco genome. 

Figure 5 shows a schematic diagram of an experiment in which OC3 1 
integrase was shown to catalyze integration of a transgene into the tobacco genome. 

DETAILED DESCRIPTION 

Definitions 

10 An "exogenous DNA segment", 'Tieterologous polynucleotide" a "transgene" 

or a "heterologous nucleic acid", as used herein, is one that originates from a source foreign 
to the particular host cell, or, if from the same source, is modified from its original form. 
Thus, a heterologous gene in a host cell includes a gene that is endogenoxis to the particular 
host cell, but has been modified. Thus, the terms refer to a DNA segment which is foreign or 

1 5 heterologous to the cell, or homologous to the cell but in a position within the host cell 
nucleic acid in which the element is not ordinarily found. Exogenous DNA segments are. 
expressed to yield exogenous polypeptides. 

The term "gene" is used broadly to refer to any segment of DNA associated 
with a biological function. Thus, genes include coding sequences and/or the regulatory 

20 sequences required for their expression. Genes can also include nonexpressed DNA 

segments that, for example, form recognition sequences for other proteins. Genes can be 
obtained from a variety of sources, including cloning from a source of interest or 
synthesizing from known or predicted sequence information, and may include sequences 
designed to have desired parameters. 

25 The terra "isolated", when applied to a nucleic acid or protein, denotes that 

the nucleic acid or protein is essentially free of other cellular components with which it is 
associated in the natural state. It is preferably in a homogeneous state although it can be in 
either a dry or aqueous solution. Purity and homogeneity are typically determined usiog 
analytical chemistry techniques such as polyacrylamide gel electrophoresis or high 

30 performance liquid chromatography. A protein which is the predominant species present in a 
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prq)aration is substantially purified. In particular, an isolated gene is separated fix>m open 
reading frames which flank the gene and encode a protein other than the gene of interest. 
The term **purified** denotes that a nucleic add or protein gives rise to essentially one band 
in an electrophoretic gel. Particularly, it means that die nucleic acid or protein is at least 
5 about 50% pure, more preferably at least about 85% pure, and most preferably at least about 
99% pure. 

The tenn ^'naturally-occurring'* is used to describe an object that can be found 
in nature as distinct from being artificially produced by man. For example, a polypeptide or 
polynucleotide sequence that is present in an organism (including viruses) that can be 

1 0 isolated fix)m a source in nature and which has not been intentionally modified by man in the 
laboratory is naturally-occurring. 

The tenn "nucleic acid" or '^polynucleotide" refers to deoxyribonucleotides 
or ribonucleotides and polymers thereof in either single- or double-stranded form. Unless 
specifically limited, the term encompasses nucleic acids containing known analogues of 

15 natural nucleotides which have sinular binding properties as the reference nucleic acid and 
are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise 
indicated, a particular nucleic acid sequence also impUcitly encompasses conservatively 
modified variants thereof (e.^. degenerate codon substitutions) and complementary 
sequences and as well as the sequence exphcitly indicated. Specifically, degenerate codon 

20 substitutions may be achieved by generatmg sequences in which the third position of one or 
more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues 
(Batzer et al. (1991) Nucleic Acid Res, 19: 5081; Ohtsuka et al (1985) 1 Biol Chem. 260: 
2605-2608; Cassol et al. (1992) ; Rossolini et al (1994) Mol Cell Probes 8: 91-98). The 
term nucleic add is used interchangeably wifli gene, cDNA, and mRNA encoded by a gene. 

25 "Nucleic acid derived fix)m a gene" refers to a nucleic acid for whose 

synthesis a gene, or a subsequence thereof (eg:, coding region), has ultimately served as a 
template. Thus, an mRNA, a cDNA reverse transcribed from an mRNA, an RNA transcribed 
fipom that cDNA, a DNA amplified from the cDNA, an RNA transcribed from the amplified 
DNA, etc., are all derived from the gene and detection of such derived products is indicative 

30 of the presence and/or abimdance of the original. 
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A DNA segment is "operably linked" when placed into a functional 
relationship with another DNA segment For example, DNA for a signal sequence is 
operably Imked to DNA encoding a polypeptide if it is expressed as a preprotein that 
participates in the secretion of the polypq)tide; a promote: or enhancer is operably linked to 
5 a coding sequence if it stimulates the transcription of the sequence. Generally, DNA 

sequences that are operably linked are contiguous, and in the case of a signal sequence both 
contiguous and in reading phase. However, enhancers, for example, need not be contiguous 
with the coding sequences whose transcription they control. Linking is accomplished by 
ligation at convenient restriction sites or at adapters or linkers inserted in lieu thereof. 

10 'Tlant** includes whole plants, plant organs (eg., leaves, stems, roots, etc.), 

seeds and plant cells and progeny of same. The class of plants that can be used in the 
methods of the invention is generally as broad as the class of higher plants amenable to 
transformation techniques, including both monocotyledonous and dicotyledonous plants. 
"Promoter'' refers to a region of DNA involved in binding the RNA 

1 5 polymerase to initiate transcription. An "inducible promoter'* refers to a promoter that directs 
expression of a gene where the level of expression is alterable by environmental or 
developmental factors such as, for example, temperature, pH, transcription factors and 
chemicals. 

The term "recombinant" when used with reference to a cell indicates that the 
20 cell replicates a heterologous nucleic acid, or expresses a peptide or protein encoded by a 
heterologous nucleic acid. Recombinant cells can contain polynucleotides that are not found 
within the native (non-recombinant) form of the cell. Recombinant cells can also contain 
polynucleotides found in the native form of the cell wherein the polynucleotides are 
modified and re-introduced into the cell by artificial means. The term also encompasses cells 
25 that contain a nucleic acid endogmous to the cell that has been modified without removing 
the nucleic acid from the cell; such modifications include those obtained by gene 
replacement, site-specific mutation, and related techniques. 

A **recombinant expression cassette" or simply an "expression cassette" is a 
nucleic acid construct, generated recombinantly or synthetically, with nucleic acid elements 
30 that are cq)able of effecting expression of a structural gene in hosts compatible with such 
sequences. Expression cassettes include at least promoters and optionally, transcription 
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tennination signals. Typically, the recombinant expression cassette includes a nucleic acid 
to be transcribed a nucleic acid encoding a desired polypeptide), and a promoter. 
Additional factors necessary or helpful in efifecting expression may also be used as described 
herein. For example, an expression cassette can also include nucleotide sequences that 
5 encode a signal sequence that directs secretion of an expressed protein from the host cell. 
Transcription tennination signals, enhancers, and other nucleic acid sequences that influence 
gene expression, can also be included in an expression cassette. 

**Recombinase" refers to an enzyme that catalyzes recombination between 
two or more recombination sites. Recombinases useful in the present invention catalyze 
1 0 recombination at specific recombination sites which are specific polynucleotide sequences 
that are recognized by a particular recombinase. The term "iategrase" refers to a type of 
recombinase. 

'Transformation rate" refers to the percent of cells that successfully 
incorporate a heterologous polynucleotide into its genome and survive. 

1 5 The term '^transgenic" refers to a cell that includes a specific modification that 

was introduced into the cell, or into an ancestor of the cell. Such modifications can include 
one or more point mutations, deletions, insertions, or combinations thereof. When referring 
to an animal, the term **transgenic" means that the animal includes cells that are transgenic. 
An animal that is composed of both transgenic and non-transgenic cells is referred to herein 

20 as a "chimeric" animal. 

The term 'Vector" refers to a composition for transferring a nucleic acid (or 
nucleic acids) to a host cell. A vector comprises a nucleic acid encoding the nucleic acid to 
be transferred, and optionally comprises a viral cq>sid or other materials for faciUtating entiy 
of the nucleic acid into the host cell and/or replication of the vector in the host cell (e.g., 

25 reverse transcriptase or other enzymes which are packaged within the capsid, or as part of 
the capsid). 

'•Recombination sites" are specific polynucleotide sequences that are 
recognized by the recombinase enzymes described herein. Typically, two different sites are 
involved (termed "complementary sites"), one present in the target nucleic acid (e.g., a 
30 chromosome or episome of a eukaiyote) and another on the nucleic acid that is to be 
integrated at the target recombination site. The terms "a//S" and "arrP," which refer to 
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attachment (or recombination) sites originally from a bacterial target and a phage donor, 
respectively, are used herein although recombination sites for particular enzymes may have 
dififerent names. The recombination sites typically include left and right arms separated by a 
core or spacer region. Thus, an attB recombination site consists of BOB', where B and B' are 
5 the left and right arms, respectively, and O is the core region. Similarly, attP is POP', where 
P and P' are the aims and O is again the core region. Upon recombination between the attB 
and attP sites, and concomitant integration of a nucleic acid at the target, the recombination 
sites that flank the integrated DNA are referred to as ''attL" and The attL and attR 

sites, using the temiinology above, thus consist of BOP' and POB', respectively. In some 
10 representations herein, the "O" is omitted and attB and attP, for example, are designated as 
BB' and PP', respectively. 

Description of the Preferred Embodiments 

The present invention provides methods for obtaining site-specific 
recombination in e\ikaiyotic cells. Unlike previously known systems for obtaining site- 

1 5 specific recombination, the products of the recombinations performed using the methods of 
the invention are stable. Thus, one can use the methods to, for example, introduce transgenes 
into chromosomes of eukaryotic cells and avoid the excision of the transgene that often 
occurs using previously known site-specific recombination systems. Stable inversions, 
translocations, and other rearrangements can also be obtained. 

20 The invention employs prokaryotic recombinases, such as bacteriophage 

integrases, that are unidirectional in that they can catalyze recombination between two 
complementary recombination sites, but caimot catalyze recombination between the hybrid 
sites that are formed by this recombination. One such recombinase, the OC3 1 integrase, by 
itself catalyzes only the attB x attP reaction. The integrase cannot mediate recombination 

25 between the attL and attR sites that are formed upon recombination between attB and attP, 
Because recombinases such as the <DC31 integrase cannot alone catalyze the reverse 
reaction, the OC3 1 attB x attP recombination is stable. This property is one that sets the 
methods of the present invention apart fix)m site-specific recombination systems currently in 
use for eucaryotic cells, such as the Cre-/ojr or FLP-FifJ system, where the recombination 

30 reactions can readily reverse. Use of the recombination systems of the invention provides 
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new opportunities for directing stable transgme and chromosome reairangements in 
eukaryotic cells. 

The methods involve contacting a pair of recombination sites {e.g,, atiB and 
attP) that are present in a eukaryotic cell with a corresponding recombinase. The 
5 recombinase then mediates recombination between the recombination sites. Depending upon 
the relative locations of the two recombination sites, any one of a number of events can 
occur as a result of the recombination. For example, if the two recombination sites are 
present on different nucleic acid molecules, the recombination can result in integration of 
one nucleic acid molecule into a second molecule. Thus, one can obtain integration of a 

1 0 plasmid that contains one recombination site into a eukaryotic ceU chromosome that includes 
the corresponding recombination site. Because the recombinases used in the methods of the 
invention cannot catalyze the reverse reaction, the integration is stable. Such methods are 
useful, for example, for obtaining stable integration into the eukaryotic chromosome of a 
transgene that is present on the plasmid. 

1 5 The two recombination sites can also be present on the same nucleic acid 

molecule. In such cases, the resulting product typically depends upon the relative orientation 
of the sites. For example, recombination between sites that are in the direct orientation will 
generally result in excision of any DNA that lies between the two recombination sites. In 
contrast, recombination between sites that are in the reverse orientation can result in 

20 inversion of the intervening DNA. Again, the resulting rearranged nucleic acid is stable in 
that the recombination is irreversible in the absence of an additional factor, generally 
encoded by the particular bacteriophage fiom which the recombinase is derived, that is not 
normally found in eukaryotic cells. One example of an ^plication for which this method is 
useful involves the placement of a promoter between the two recombination sites. If the 

25 promoter is initially in the opposite orientation relative to a coding sequence that is to be 
expressed by the promoter and the recombination sites tfiat flank the promoter are in the 
inverted orientation, contacting the recombination sites will result m inversion of the 
promoter, thus placing the promoter m the correct orientation to drive rapression of the 
coding sequence. Sunilarly, if the promoter is initially in the correct orientation for 

30 expression and the recombination sites are in the same orientation, contacting the 
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recombination sites with the promoter can result in excision of the promoter fragment, thus 
stopping expression of the coding sequence. 

The methods of the invention are also useful for obtaining translocations of 
chromosomes, for example. In these embodiments, one recombination site is placed on one 

5 chromosome and a second recombination site that can serve as a substrate for recombination 
with the first recombination site is placed on a second chromosome. Upon contacting the two 
recombination sites with a recombinase, recombination occurs that results in swapping of the 
two chromosome anns. For example, one can construct two strains of an organism, one 
strain of which includes the first recombination site and the second strain that contains the 

10 second recombination site. The two strains are then crossed, to obtain a progeny strain that 
includes both of the recombination sites. Upon contacting the sites with the recombinase, 
chromosome ann swapping occurs. 

Recombinases and Recombination Sites 

The methods of the invention use recombinase systems to achieve stable 
1 5 integration or other rearrangement of nucleic acids in eukaryotic cells. A recombinase 
system typically consists of three elements: two specific DNA sequences C*the 
recombination sites") and a specific enzyme C*the recombinase'*). The recombinase catalyzes 
a recombination reaction betwera the specific recombination sites. 

Recombination sites have an orientation. La other words, they are not 
20 palindromes. The orientation of the recombination sites in relation to each other determines 
what recombmation event takes place. The recombination sites may be in two different 
orientations: parallel (same direction) or opposite. When the recombination sites are present 
on a single nucleic acid molecule and are in a parallel orientation to each other, then the 
recombination event catalyzed by the recombinase is a typically an excision of the 
25 intervening nucleic acid, leaving a single recombination site. When the recombination sites 
are in the opposite orientation, then any intervening sequence is typically inverted. 

The recombinases used in the methods of the invention can mediate site- 
specific recombination between a first recombination site and a second recombination site 
that can serve as a substrate for recombination with the first recombination site. However, m 
30 the absence of an additional factor that is not normally present in eukaryotic cells, cannot 
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mediate recombinatioii between two hybrid recombination sites that are formed i^on 
recombination between the first recombination site and the second recombination site. 
Examples of these recombinases include, for example, the bacteriophage OC31 integrase 
(see, e.g,, Thorpe & Smith (1998) Proa Natl Acad ScL USA 95: 5505-5510; Kuhstoss & 
5 Rao (1991) J. MoL Biol 222: 897-890; US Patent No. 5,190,871), a phage P4 recombinase 
(Ow & Ausubel (1983) J. BacterioL 155: 704-713), a Listeria phage recombinase, a 
bacteriophage R4 Sre recombinase (Matsuura et al (1996) /. BacterioL 178: 3374-3376), a 
CisA recombinase (Sato etal (1990)7. BacterioL 172: 1092-1098; Stragier er a/. (1989) 
Science 243: 507-512), an XisF recombinase (Carrasco et aL (1994) Genes Dev. 8: 74-83), 

10 and a transposon ln4451 TnpX recombinase (Bannam et aL (1995) MoL MicrobioL 16: 535- 
551; Crelin & Rood (1997) J. BacterioL 179: 5148-5156). 

Recombinase polypeptides, and nucleic acidfe that encode the recombinase 
polypeptides, are described in the art and can be obtained using routine methods. For 
example, a vector that includes a nucleic acid fiagment that encodes the OC3 1 integrase is 

15 described in US Patent No. 5,190,871 and is available from the Northern Regional Research 
Laboratories, Peoria, Illinois 61604) under the accession number B- 18477. 

The recombinases can be introduced into the eukaryotic cells that contain the 
recombination sites at which recombination is desired by any suitable method. For example, 
one can introduce the recombinase in polypeptide form, e.g.y by microinjection or other 

20 methods. In presently preferred embodiments, however, a gene that encodes the recombmase 
is introduced into the cells. Expression of the gene results in production of the recombinase, 
which then catalyzes recombination among the corresponding recombination sites. One can 
introduce the recombinase gene into the cell before, after, or simultaneously with, the 
introduction of the exogenous polynucleotide of interest. In one embodiment, the 

25 recombinase gene is present withiii the vector that carries the polynucleotide that is to be 
inserted; the recombinase gene can even be included within the polynucleotide. In other 
embodiments, the recombinase gene is introduced into a transgenic eukaryotic organism, 
e.g., a transgenic plant, animal, fimgus, or the like, which is then crossed with an organism 
that contains the corresponding recombination sites. 
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Target Organisms 

The methods of the invention are useful for obtaining stable integration 
and/or reairangement of DNA in any type of eukaryotic cell. For example, the methods are 
useful for cells of animals^ plants, fungi, bacteria and other microorganisms. In some 
5 embodiments, the cells are part of a multicellular organism, e.g., a transgenic plant or 
animal. The methods of the invention are particularly useful in situations where transgenic 
materials are difficult to obtain, such as with transgenic wheat, com, and animals. In these 
situations, finding the rare single copy insertion requires the prior attainment of a large 
number of independently derived transgenic clones, which itself requires great expenditure 
10 ofefifort. 

Among the plant targets of particular interest are monocots, including, for 
example, rice, com, wheat, rye, barley, bananas, pabns, lilies, orchids, and sedges. Dicots are 
also suitable targets, including, for example, tobacco, apples, potatoes, beets, carrots, 
willows, elms, maples, roses, buttercups, petunias, phloxes, violets and sunflowers. Other 
15 targets include animal and fungal cells. These lists are merely illustrative and not limiting. 

Constructs for Introduction of Exogenous DNA into Target Cells 

The methods of the invention often involve the introduction of exogenous 
DNA into target cells. For example, nucleic acids that include one or more recombination 
sites are often introduced into the cells. The polynucleotide constructs that are to be 
20 introduced into the cells can include, in addition to the recombination site or sites, a gene or 
other functional sequence that will confer a desired phenotype on the cell. 

In some embodiments, a polynucleotide construct that encodes a recombinase 
is introduced into the eukaryotic cells in addition to the recombination sites. The 
recombinase-encoding polypeptide can be included on the same nucleic acid as the 
25 recombination site or sites, or can be introduced into the cell as a separate nucleic acid. The 
present invention provides nucleic acids that include recombination sites, as well as nucleic 
acids in which a recombinase-encoding polynucleotide sequence is operably linked to a 
promoter that functions in the target eukaryotic cell. 

Generally, a polynucleotide that is to be expressed (e.g. , a recombinase- 
30 encoding polynucleotide or transgene of interest) will be present in an expression cassette, 
meaning that the polynucleotide is operably linked to expression control signals, e.g., 
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promoters and terminators, that are functional in the host cell of interest. The genes that 
encode the recombinase and the selectable maricer, will also be under the control of such 
signals that are functional in the host cell. Control of expression is most easily achieved by 
selection of a promoter. The transcription terminator is not generally as critical and a variety 
5 of known elements may be used so long as they are recognized by the cell. 

A promoter can be derived from a gene that is under investigation, or can be a 
heterologous promoter that is obtained from a diffoent gene, or from a different species. 
Where direct expression of a gene in all tissues of a transgenic plant or other organism is 
desired, one can use a "constitutive" promoter, which is generally active under most 

10 environmental conditions and states of development or cell differentiation. Suitable 

constitutive promoters for use in plants include, for example, the cauliflower mosaic virus 
(CaMV) 35S transcription initiation region and region VI promoters, the 1- or 2 - promoter 
derived from T-DNA oiAgrobacterium ttmefaciens, and other promoters active in plant 
cells that are known to those of skill in the art. Other suitable promoters include the fiiU- 

1 5 length transcript promoter from Figwort mosaic virus, actin promoters, histone promoters, 
tubulin promoters, or the maimopine synthase promoter (MAS). Other constitutive plant 
promoters include various ubiquitin or polyubiquitin promoters derived from, inter alia^ 
Arabidopsis (Sun and CaUis, Plant J., 11(5):1017-1027 (1997)), the mas, Mac or DoubleMac 
promoters (described in United States Patent No, 5,106,739 and by Comai et aL, Plant Mol 

20 Biol 15:373-381 (1990)) and other transcription initiation regions from various plant genes 
known to those of skill in the art Such genes include for example, ACTll bom Arabidopsis 
(Huang etal. Plant Mol Biol 33:125-139 (1996)), Cat3 torn Arabidopsis (GenBankNo. 
U43147, Zhong et al, Mol Gen. Genet 25 1 : 196-203 (1996)), the gene encoding stearoyl- 
acyl carrier protein desaturase from Brassica napus (Genbank No. X74782, Solocombe et 

25 al. Plant Physiol 104:1 167-1 176 (1994)), GPcl from maize (GenBank No. X15596, 
Martinez et al, J. Mol Biol 208:551-565 (1989)), and Gpc2 from maize (GenBank No. 
U45855, Manjunath et al. Plant Mol Biol 33:97-1 12 (1997)). Useful promoters for plants 
also include those obtained from Ti- or Ri-plasmids, from plant cells, plant viruses or other 
hosts whore the promoters are found to be functional in plants. Bacterial promoters that 

30 function in plants, and thus are suitable for use in the methods of the invention include the 
octopine synthetase promoter, the nopaline synthase promoter, and the manopine synthetase 
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promoter. Suitable endogenous plant promoters include the ribulose-l,6-biphosphate 
(RUBP) carboxylase small subunit (ssu) promoter, the (a-conglycinin promoter, the 
phaseolin promoter, the ADH promoter, and heat-shock promoters. 

Promoters for use in E. coli mclude the T7, trp, or lambda promoters, a 
5 ribosome binding site and preferably a transcription termination signal. For eukaryotic cells, 
the control sequences typically include a promoter which optionally includes an enhancer 
derived from immunoglobulin genes, SV40, cytomegalovirus, e/c, and apolyadenylation 
sequence, and may include splice donor and acceptor sequences. In yeast, convenient 
promoters include GALMO (Johnson and Davies (1984) Mol Cell Biol 4: 1440-1448) 

10 ADH2 (Russell et al (1983) 7. Biol Chem, 258:2674-2682), PH05 (EMBOJ. (1982) 6:675- 
680), and MFa (Herskowitz and Oshima (1982) in TTie Molecular Biology of the Yeast 
Saccharomyces (eds. Strathem, Jones, and Broach) Cold Spring Harbor Lab., Cold Spring 
Harbor, N.Y., pp. 181-209). 

Alternatively, one can use a promoter that directs expression of a gene of 

1 5 interest ui a specific tissue or is otherwise under more precise environmental or 

developmental control. Such promoters are referred to here as "inducible" or '*repressible" 
promoters. Examples of environmental conditions that may effect transcription by inducible 
promoters include pathogen attack, anaerobic conditions, ethylene or the presence of light. 
Promoters under developmental control include promoters that initiate transcription only in 

20 certain tissues, such as leaves, roots, fiuit, seeds, or flowers. The operation of a promoter 
may also vary depending on its location in the genome. Thus, an inducible promoter may 
become fully or partially constitutive in certain locations. Inducible promoters are often used 
to control expression of the recombinase gene, thus allowing one to control the timing of the 
recombination reaction. Examples of tissue-specific plant promoters under developmental 

25 control include promoters that initiate transcription only in certain tissues, such as fruit, 
seeds, or flowers. The tissue-specific E8 promoter firom tomato is particularly useful for 
directing gene expression so that a desired gene product is located in fiuits. See, e,g,, Lincoln 
etal (mS) Proa Natl Acad &/. USA 84: 2793-2797; DeikmaneM/. (1988) £MBO/. 7: 
3315-3320;Deikmane/a/.(1992)/^/a/irP/i;^5/o/, 100:2013-2017. Other suitable promoters 

30 include those from genes encoding embryonic storage proteins. Examples of environmental 
conditions that may affect transcription by inducible promoters include anaerobic conditions. 
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elevated temperature, or the presence of light. Additional organ-specific, tissue-specific 
and/or inducible foreign promoters are also known {see, e.g,y references cited in Kuhlemeier 
et al (1987) Ann, Rev. Plant Physiol 38:221), including those 1,5-ribulose bisphosphate 
carboxylase small subunit genes of Arabidopsis thaliana (the "ssu" promoter), which are 
5 light-inducible and active only in photosynthetic tissue, anther-specific promoters (EP 

344029), and seed-specific promoters of, for example, Arabidopsis thaliana (Krebbers et al 
(1988) Plant Physiol 87:859). Exemplary green tissue-specific promoters include the maize 
phosphoenol pyruvate caiboxylase (PEPC) promoter, small submit ribulose bis-carboxylase 
promoters (ssRUBISCO) and the chlorophyll a/b binding protein promoters. The promoter 

1 0 may also be a pith-specific promoter, such as the promoter isolated fix>m a plant TrpA gene 
as described in International Publication No. W093/07278. 

Inducible promoters for other organisms include, for example, the arabinose 
promoter, the lacZ promoter, the metallothionein promoter, and the heat shock promoter, as 
well as many others that are known to those of skill in the art. An example of a repressible 

15 promoter usefiil in yeasts such as S. pombe is the Pmnt promoter, which is repressible by 
vitamin BL 

Typically, constructs to be introduced into these cells are prepared using 
recombinant expression techniques. Recombinant expression techniques involve the 
construction of recombinant nucleic acids and the expression of genes in transfected cells. 

20 Molecular cloning techniques to achieve these ends are known in the art. A wide variety of 
cloning and in vitro amplification methods suitable for the construction of recombinant 
nucleic acids are well-known to persons of skill. Examples of these techniques and 
instructions sufficient to direct persons of skill through many cloning exercises are found in 
Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology^ 

25 Volume 152, Academic Press, Inc., San Diego, CA (Berger); and Current Protocols in 
Molecular Biology, F.M. Ausubel et al, eds.. Current Protocols^ a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1998 Supplement) 
(Ausubel). 

The construction of polynucleotide constructs generally requires the use of 
30 vectors able to repUcate in bacteria. A plethora of kits are commercially available for the 
purification of plasmids 6com bacteria. For their proper use, follow the manufacturer's 

17 
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instructions {see, for example, EasyPrepJ, FlexiPrepJ, both from Pharmacia Biotech; 
StrataCleanJ, from Stiatagene; and, QIAexpress Expression System, Qiag^). The isolated 
and purified plasmids can then be further manipulated to produce other plasmids, used to 
transfect cells or incorporated into Agrobacterium tumefaciens to infect and transform 
5 plants. Where Agrobacterium is the means of transformation, shuttle vectors are constructed. 
Cloning in Streptomyces or Bacillus is also possible. 

Selectable markers are often incorporated into the polynucleotide constructs 
and/or into the vectors that are used to introduce the constructs into the target cells. These 
markers permit the selection of colonies of cells containing the polynucleotide of interest. 

10 Often, the vector will have one selectable marker that is flmctional in, e.g.y E. colij or other 
cells in which the vector is repUcated prior to being introduced into the target cell Examples 
of selectable markers for coli include: genes specifying resistance to antibiotics, z.e., 
ampicillin, tetracycline, kaiuimycin, erythromycin, or genes conferring other types of 
selectable enzymatic activities such as P-galactosidase, or the lactose operon. Suitable 

15 selectable markers for use in mammalian cells include, for example, the dihydrofolate 
reductase gene (DHFR), the thymidine kinase gene (TK), or prokaryotic genes conferring 
drug resistance, gpt (xanthine-guanine phosphoribosyltransferase, which can be selected for 
with mycophenolic acid; neo (neomycin phosphotransferase), which can be selected for with 
G418, hygromycin, orpuromycin; and DHFR (dihydrofolate reductase), which can be 

20 selected for with methotrexate (Mulligan & Berg (1981) Proc. Natl Acad, Sci. USA 78: 
2072; Southern & Berg (1982) /. MoL Appl Genet. 1: 327). 

Selection markers for plant cells often confer resistance to a biocide or an 
antibiotic, such as, for example, kanamycin, G 418, bleomycin, hygromycin, or 
chloramphenicol, or herbicide resistance, such as resistance to chlorsulftiron or Basta. 

25 Examples of suitable coding sequences for selectable markers are: the neo gene which codes 
for the enzyme neomycin phosphotransferase which confers resistance to the antibiotic 
kanamycin (Beck et al (1982) Gene 19:327); the kyg (hpt) gene, which codes for the enzyme 
hygromycin phosphotransferase and confers resistance to the antibiotic hygromycin (Gritz 
and Davies (1983) Gene 25:179); and the bar gene (EP 242236) that codes for 

30 phosphinothricin acetyl transferase which confers resistance to the herbicidal compounds 
phosphinothricin and bialaphos. 
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If more than one exogenous nucleic acid is to be introduced into a target 
eukaryotic cell, it is generally desirable to use a different selectable marker on each 
exogenous nucleic acid. This allows one to simultaneously select for cells that contain both 
of the desired exogenous nucleic acids. 

5 Methods for Introducing Constructs into Target Cells 

The polynucleotide constructs that include recombination sites and/or 
recombinase-encoding genes can be introduced into the target cells and/or organisms by any 
of the several means known to those of skill in the art. For instance, the DNA constructs can 
be introduced into plant cells, either in culture or in the organs of a plant by a variety of 

10 conventional techniques. For example, the DNA constructs can be introduced directly to 
plant cells using biolistic methods, such as DNA particle bombardment, or the DNA 
construct can be introduced using techniques such as electroporation and microinjection of 
plant cell protoplasts. Particle-mediated transformation techniques (also known as 
*l)iolistics*') are described in Klem et al, Nature, 327:70-73 (1987); Vasil, V. et al, 

15 Bio/Technol 1 1 :1553-1558 (1993); and Becker, D. et al. Plant J., 5:299-307 (1994). These 
methods involve penetration of cells by small particles with the nucleic acid either within the 
matrix of small beads or particles, or on the surface. The biolistic PDS-1000 Gene Gun 
(Biorad, Hercules, CA) uses heUum pressure to accelerate DNA-coated gold or tungsten 
microcarriers toward target cells. The process is applicable to a wide range of tissues and 

20 cells firom organisms, including plants, bacteria, fungi, algae, intact animal tissues, tissue 
culture cells, and animal embryos. One can employ electronic pulse delivery, which is 
essentially a mild electroporation format for live tissues in animals and patients. Zhao, 
Advanced Drug Delivery Reviews 17:257-262 (1995). 

Other transformation methods are also known to those of skill in the art 

25 Microinjection techniques are known in the art and well described in the sci^tific and patent 
literature. The introduction of DNA constructs using polyethylene glycol (PEG) precipitation 
is described in Paszkowski et ah, EMBO J. 3:2717 (1984). Electroporation techniques are 
described in Fromm et aL, Proc. Natl Acad. Scl USA, 82:5824 (1985). PEG-mediated 
transformation and electroporation of plant protoplasts are also discussed in Lazzeri, P., 

30 Methods Mol Biol 49:95-106 (1995). Methods are known for introduction and expression of 
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heterologous genes in both monocot and dicot plants. See, e.g., US Patent Nos. 5,633,446, 
5,317,096, 5,689.052, 5,159,135, and 5,679,558; Weising etal (1988) ^/iw. Rev. Genet 
ll'All-An. Transfonnation of monocots in particular can use various techniques including 
electroporation {e.g., Shimamoto et al. Nature (1992), 338:274-276); biolistics (e.g., 
5 European Patent Application 270,356); and Agrobacterium (e.g., Bytebier et a/., Proc, Natl 
Acad. Sci, USA (1987) 84:5345-5349). 

For transformation of plants, DNA constructs may be combined with suitable 
T-DNA flanking regions and introduced into a conventional Agrobacterium tumefaciens host 
vector. The virulence functions of the A. tumefaciens host will direct the insertion of a 

10 transgene and adjacent marker gene(s) (if present) into the plant cell DNA when the cell is 
infected by the bacteria. Agrobacterium /wm^c/e/ty-meditated transformation techniques 
are well described in the scientific literature. See, for example, Horsch et al Science, 
233:496-498 (1984), Fraley et al, Proc, Natl Acad. Sci. USA, 80:4803 (1983), and 
Hooykaas, Plant Mol Biol, 13:327-336 (1989), Bechtold et al, Comptes Rendus De L 

1 5 Academic Des Sciences Serie lii-Sciences De La Vie-Life Sciences, 316: 1 194-1 199 (1993), 
Valvekens et al, Proc. Natl Acad. ScL USA, 85:5536-5540 (1988). For a review of gene 
transfer methods for plant and cell cultures, see, Fisk et al, Scientia Horticulturae 55:5-36 
(1993) and Potrykus, CIBA Found Symp. 154:198 (1990). 

Other methods for delivery of polynucleotide sequences into cells include, for 

20 example liposome-based gene deUvery (Debs and Zhu (1993) WO 93/24640; Mannino and 
Gould-Fogerite {\9%i) BioTechniques 6(7): 682-691; Rose U.S. Pat No. 5,279,833; Brigham 
(1991) WO 91/06309; and Feigner et al (1987) Proc Natl Acad Sci USA 84: 7413-7414), 
as well as use of viral vectors {e.g., adenoviral {see, e,g., Bems et al (1995) Ann. NY Acad 
Sci 772: 95-104; Ali et al (1994) Gene Ther. 1: 367-384; and Haddada et al (1995) Curr. 

25 Top. Microbiol Immunol 199 ( Pt 3): 297-306 for review), papillomaviral, retroviral {see, 
e.g., Buchscher et al (1992) 1 Virol 66(5) 2731-2739; Johann et al (1992) J. Virol 66 
(5):1635-1640 (1992); Sommerfelt et al, (1990) Virol 176:58-59; Wilson et al (1989)/. 
Virol 63:2374-2378; MiUer et al., J. Virol 65:2220-2224 (1991); Wong-Staal et al, 
PCT/US94/05700, and Rosenburg and Fauci (1993) in Fundamental Immunology, Third 

30 Edition Paul (ed) Raven Press, Ltd., New York and the references therein, and Yu et al. 
Gene Therapy (1994) supra.), and adeno-associated viral vectors {see. West et al (1987) 

20 
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Virology 160:38-47; Carter et al (1989) U.S. Patent No. 4,797,368; Carter et al WO 
93/24641 (1993); Kotin (1994) Human Gene Therapy 5:793-801; Muzyczka (1994) 1 Clin, 
Invst 94:1351 and Samulski (supra) for an overview of AAV vectors; see also^ Lebkowski, 
U.S. Pat. No. 5,173,414; Tratschin et al (1985) Mol Cell Biol 5(1 1):3251-3260; Tratschin 
5 et al. (1984) Mol Cell Biol, 4:2072-2081; Hennonat and Muzyczka (1984) Proc, Natl 
Acad. Sci. USA, 81 :6466-6470; McLaughlin et al (1988) and Samulski et al (1989) /. 
Virol, 63:03822-3828), and the like. 

Methods by which one can analyze the integration pattern of the introduced 
exogenous DNA are well known to those of skill in the art. For example, one can extract 
10 DNA from the transformed cells, digest the DNA with one or more restriction enzymes, and 
hybridize to a labeled fragment of the polynucleotide construct. The inserted sequence can 
also be identified using the polymerase chain reaction (PGR). See, e.g., Sambrook et al. 
Molecular Cloning - A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Haibor, New York, 1989 for descriptions of these and other suitable methods. 

1 5 Regeneration of Transgenic Plants and Animals 

The methods of the invention are particularly usefiil for obtaining transgenic 
and chimeric multicellular organisms that have a stably integrated exogenous polynucleotide 
or other stable rearrangement of cellular nucleic acids. Methods for obtaining transgenic and 
chimeric organisms, both plants and animals, are well known to those of skill in the art. 

20 Transformed plant cells, derived by any of the above transformation 

techniques, can be cultured to regenerate a whole plant which possesses the transformed 
genotype and thus the desired phenotype. Such regeneration techniques rely on 
manipulation of certain phytohoimones in a tissue culture growth medium, typically relying 
on a biocide and/or herbicide marker which has been introduced together with the desired 

25 nucleotide sequences. Plant regeneration from cultured protoplasts is described in Evans et 
al. Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, pp. 124-176, 
Macmillian Publishing Company, New York (1983); and in Binding, Regeneration of 
Plants, Plant Protoplasts, pp. 21-73, CRC Press, Boca Raton, (1985). Regeneration can also 
be obtained from plant callus, explants, somatic embryos (Dandekar et al, J. Tissue Cult. 

30 Metk, 12:145 (1989); McGranahan et al. Plant Cell Rep., 8:512 (1990)), organs, or parts 
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thereof. Such regeneration techniques are described generally in Klee et al, Ann, Rev. of 
Plant Phys,, 38:467-486 (1987). 

The methods are useful for producing transgenic and chimeric animals of 
most vertebrate species. Such species include, but are not limited to, nonhuman mammals, 
5 including rodents such as mice and rats, rabbits, ovines such as sheep and goats, porcines 
such as pigs, and bo vines such as cattle and buffalo. Methods of obtaining transgenic 
animals are described in, for example, Puhler, A., Ed., Genetic Engineering of Animals^ 
VCH Publ, 1993; Murphy and Carter, Eds., Transgenesis Techniques : Principles and 
Protocols (Methods in Molecular Biology, Vol. 18), 1993; and Pinkert, CA, Ed., Transgenic 
1 0 Animal Technology : A Laboratory Handbook, Academic Press, 1 994. Transgenic fish 
having specific genetic modifications can also be made using the claimed methods. See, 
eg., Iyengar et al (1996) Transgenic Res. 5: 147-166 for general methods of making 
transgenic fish. 

One method of obtaining a transgenic or chimeric animal having specific 

1 5 modifications in its genome is to contact fatilized oocytes with a vector that includes the 
polynucleotide of interest flanked by recombination sites. For some animals, such as mice 
fertilization is performed in vivo and fertilized ova are surgically removed. In other animals, 
particularly bovines, it is preferably to remove ova firom live or slaughterhouse animals and 
fertilize the ova in vitro. See DeBoer et al, WO 91/08216. In vitro fertilization pennits the 

20 modifications to be introduced into substantially synchronous cells. Fertilized oocytes are 
then cultured in vitro until a pre-implantation embryo is obtained containing about 16-150 
cells. The 16-32 cell stage of an embryo is described as a morula. Pre-implantation 
embryos containing more than 32 cells are termed blastocysts. These embryos show the 
development of a blastocoel cavity, typically at the 64 cell stage. If desired, the presence of 

25 a desired exogenous polynucleotide in the embryo cells can be detected by methods known 
to fliose of skill in the art Methods for culturing fertilized oocytes to the pre-implantation 
stage are described by Gordon et al (1984) Methods EnzymoL 101: 414; Hogan et al. 
Manipulation of the Mouse Embryo: A Laboratory Manual, C.S.HX. N.Y. (1986) (mouse 
embryo); Hanmier et al. (1985) Nature 3 1 5: 680 (rabbit and porcine embryos); Gandolfi et 

30 aL (1987)7. Reprod Pert. 81: 23-28;Rexroade/a/. (1988)/.^n/w, ScL 66: 947-953 (ovine 
embryos) and Eyestone et aL (1989) J. Reprod, Pert, 85: 715-720; Camous et al. (1984) /. 
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Reprod Pert. 72: 779-785; and Heyman et ah (1987) Theriogenology 27: 5968 (bovine 
embryos). Sometimes pre-implantation embryos are stored fix)zen for a period pending 
implantation. Pre-implantation embryos are transferred to an ^propriate female resulting in 
the birth of a transgenic or chimeric animal depending upon the stage of development when 
5 the transgene is integrated. Chimeric mammals can be bred to form true germline transgenic 
animals. 

Alternatively, the methods can be used to obtain embryonic stem cells (ES) 
that have a single copy of the desired exogenous polynucleotide. These cells are obtained 
from preimplantation embryos cultured in vitro. See, e,g., Hooper, ML, Embryonal Stem 

10 Cells : Introducing Planned Changes into the Animal Germline (Modem Genetics, v. 1), 
IntM. Pub. Distrib., Inc., 1993; Bradley et al (1984) Nature 309, 255-258. Transformed ES 
cells are combined with blastocysts from a non-human animal. The ES cells colonize the 
embryo and in some embryos form the genn line of the resulting chimeric animal. See 
Jaenisch, Science, 240: 1468-1474 (1988). Alternatively, ES cells or somatic cells that can 

15 reconstitute an organism ("somatic repopulating cells*") can be used as a source of nuclei for 
transplantation into an enucleated f^tilized oocyte giving rise to a transgenic mammal. See, 
Wihnut et al (1997) Nature 385: 810-813. 

EXAMPLES 

The following examples are offered to illustrate, but not to limit the present 

20 invention. 

Example 1 

The <I>C31 Recombination System Functions in Schizosaccharomvces vombe 

This Example demonstrates that the Streptomyces bacteriophage <[>C31 site- 
specific recombination system functions in eukaryotic cells. A bacteriophage attachment site 
25 {attP) was introduced into a chromosome of Schizosaccharomyces pombe at the S, pombe 
leul locus. This target strain was subsequently transformed with a plasmid that contains the 
bacterial attachment site (attB) linked to a ura4^ selectable marker. When co-transformed 
with a second plasmid harboring the OC31 integrase gene, high efficiency transformation to 
Ura+ was observed under conditions where the integrase gene was expressed. 
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Southern analysis of the integration events shows insertion of the attB'Ura4^ 
plasmid mto the attP site of the leul locus. Nucleotide sequence of the hybrid junctions 
revealed that the attB x attP recombination reaction is precise. 

Materials and Methods 

5 Recombinant DNA 

Standard methods were used throughout. E, coli strain XL2-Blue {recAl 
endAl gyrA96 thi-l hsdRl 7 supE44 relAl lac [F proAB lacB ZAMISlnlO (TetO Amy 
Cam^, Strategene) served as host for DNA constructs. 

Media 

1 0 Fission yeast strains were grown on minimal medium (EMM-low glucose, 

from BiolOl) supplemented as needed with 225 mg/1 adenine, histidine, leucine or uracil. 
Minimal plates with 5-FOA (5-floroorotic acid, from Zymo Research, Inc.) were prepared 
according to Grimm et al ((1988) Mol Gen. Genet 215: 81-86) and were supplemented 
with adenine, histidine, and leucine. When used, thiamine was added to 5 ^g/ml. 

15 S. pombe with <PC31 attP target 

The 84 bp OC31 attP site (abbreviated as PP*), isolated as anApal-Sacl 
fragment from pHS282 (Thoipe & Smith (1998) Proc. Natl Acad. Sci. USA 95:5505-5510) 
was cloned into the same sites of the S. pombe mtegrating vector pJK148 (Keeney & Boeke 
(1994) Genetics 1 36:849-856) to make pLT44. This plasmid was targeted to the S. pombe 

20 leul-32 allele by lithium acetate mediated transformation with Ndel cut DNA. The recipient 
host FY527 (h' ade6-M216 his3-Dl leul-32 ura4-D18\ converted to Leu* by homologous 
recombination with pLT44, was examined by Southem analysis. One Leu* transformant, 
designated FY527attP, was found to contain a single copy of pLT44. Another transformant, 
designated FY527attPx2, harbors a tandem plasmid insertion. 

25 Integrative nrz/C vector with <PC31 attB site 

The S, pombe ura4^ gene, excised from pTZura4 (S. Forsburg) on a 1.8 kb 
EcdBl'Bamm fragment, was inserted into pJK148 cut with the same enzymes to create 
pLT40. The <1>C31 attB site (abbreviated as BB'), isolated from pHS21 as a 500 bp BamHl- 
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Xbal fragment, was ligated into pLT40 cut with those enzymes, creating pLT42. Most of the 
leul gene was removed from pLT42 by deleting aXhol fragment to create pLT45. This 
removed all but 229 bp of leul from pLT45 and reduced its transformation eCBciency to that 
of a plasmid without any leul homology. pLT50, which has a second attB site in the same 
orientation immediately on the other side of ura4y was constructed by first subcloning the 
attB BammSacI fragment from pLT42 into pUC19, excising it with EcoRl and SalR, and 
subsequently insertmg it into pLT45 cut with EcoRL and Xhol. The second attB site in the 
final construct was sequenced once on each strand and foxmd to be identical to the first attB 
site. 

Linear DNA transformation 

The attB-ura4^-attB linear DNA was prepared as an AttU-AlwNl fragment 
purified from pLTSO, or as a PGR product using pLT50 as template. PGR was conducted 
using standard conditions with a T3 primer and a second primer (5* ggc cct gaa att gtt get tct 
gcc 3*) corresponding to the plasmid backbone of pJK148. 

Repressible synthesis of <PC31 integrase 

The S. pombe Pmnt promoter, repressible by vitamin B 1 , was excised as a 1 .2 
kb Pstl'Sacl figment firom pM0147 and inserted into the his3^, arsl vector pBG2 (Ohi et 
al (1996) Gene 174: 315-318) cut with the same enzymes, creating pLT41. A 2.0 kb iSatd 
Augment containing the <DG31 int coding region was transferred from pHS33 (Thoipe & 
Smith (1998) supra) to the Sad site of pLT41. A clone m which the int coding region is 
oriented such that expression is under the control of Pmnt was designated pLT43. 

Molecular analyses 

Southern analysis was perfomied using the Genitxs™ system from Boehringer 
Mannheim. A 998 bp internal EcoRV fragment of leul, a 1.8 kb firagment of ura4 , and the 
2.0 kb <I>C3 1 int gene were digoxigen-labeled by the random primer method and used as 
probes. Polymerase chain reaction was performed on a Perkin Ehner Getus Gene Amp PGR 
9600 using Stratagene Turbo PFU enzyme or VENT polymerase. The standard T3 and T7 
primers were used where possible. The ura4 primer (5' gtc aaa aag ttt cgt caa tat cac 3* 
(SEQ ID NO: 1)) and the pJK148 primers were purchased bom Operon Technologies. For 
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all PCR reactions an annealing temperature of 51^C and a 30-second extension time were 
used. 

Results and Discussion 

Inserting a target site into the S. pombe genome 
5 To create a host strain with a target site for OC3 1 -mediated integration, the 

<t>C31 attP site was inserted by homologous recombination into the leul locus of the fission 
yeast genome to form the Leu*^ strain FY527attP (Figure 1 A). Previous studies showed that 
when X pombe DNA is cleaved with Xbal and probed with an intemal 1 kb fragment of the 
leul* gene, the probe detects a 14 kb band (Keeney & Boeke (1994) Genetics 136: 849-856). 
1 0 Insertion of the leu'*' plasmid pJKl 48 at the leu 1 -32 locus resuhs in detection of 3 and 1 8 kb 
bands (Figure lA). Since pLT44 differs &om pJK148 by the inclusion of an 84 bp OC31 
attP element, integration of pLT44 at /ei/1-32 yielded the same 3 kb and 18 kb hybridization 
pattem in FY527a/rP. The absence of other hybridizing fragments indicates that the pLT44 
DNA resides as a single integrated copy. 

IS (PC31-integase-mediated transformation 

FY527atttP was transformed witii pLT45, which harbors ura^*" and an attB 
sequence (BB') but lacks an origin of rephcation. This construct was introduced by itself or 
with pLT43, a his3^ replicating vector that produces <I>C31 integrase. The inclusion of 
pLT43 increased the number of Ura^ transformants an average of 15 fold (Table 1). This 

20 enhancement cannot be attributed to the recombination between pLT45 and the replication- 
proficient pLT43, as its effect is dependent on integrase gene expression. Transcription of 
the integrase gene is under the control of Pmnty a promoter repressible by high levels of 
vitamin Bl (Maundrell, K. (1993) Gene 123: 127-130). The repression is not absolute 
(Forsburg, S. L. (l 993) Nucleic Acids Res 21: 2955-2956) but reduces the production of 

25 integrase protein. When thiamine was added to the growth medium, the number of Ura* 
transformants decreased to near background level. The frequency of Ura^ tranformants did 
not change significantly whether or not the integrase plasmid was co-selected by omission of 
histidine from the medium. The transformation competency of FY527atttP was estimated 
from the nimiber of His^ transformants obtained with pLT43 or its progenitor plasmid pBG2. 
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Compared to the frequency of either replicating plasmid, the pLT43-dependent 
transformation of FY527attP averaged about 1 5%. 

Table 1 : Integrase-dependent site-specific insertion in S, pombe FY527attP, 

5 



DNA 


Selection 


Bl 


Transfoimants 

per lO' cells 
(±sd)* 


Relative 
Value' 


Class a 


Class b 


Others 


pLT43 


His"" 




7200 (+2200) 


100 








pLT45 


Ura* 




63CtlO) 


1 


0%* 


0%* 


100%* 


pLT45 + 






1100 (tl20) 


15 


88%^ 




6%^ 


pLT43 














pLT45 + 


Ura^ 


+ 


120 (+16) 


2 


0%* 


25%* 


75%* 



pLT43 



*From three independent experiments 

^(transformation efiBciency of the DNA of interest)/(transformation efficiency of pLT43) x 
100 

10 ^n=16 

*n=8 

^31'integrase promoted attP jc attB recombination 

Recombination between the pLT45-encoded OC31 attB element and the 
chiomosomally situated attP sequence would incorporate the circular DNA into the leul 

15 locus as depicted in Figure IB. If this reaction occurs, AT^^jI-fractionated genomic DNA 
from the Ura^ transformants is probed with leul DNA, the 3 kb band will remain unchanged, 
while the 18 kb band will increase to -23 kb (Figure IC). Randomly selected Ura^ colonies 
were examined by hybridization analysis. Of eight isolates derived from experiments where 
4>C31 integrase gene expression was derepressed by the omission of thiamine, seven showed 

20 the presence of this -23 kb band. This same size band hybridized to the ura4 probe. This 
contrasts with the lack of Mni4 hybridization with the parental strain, as expected from its 
ttra4-Dl 8 deletion allele. One of these seven isolates showed additional bands hybridizing 
to both probes. This candidate appears to have a DNA rearrangement at the leul locus in 
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addition to a site-specific recombination event. The leul rearrangemmt was probably 
catalyzed by the operative 5. pombe homologous recombination system. The remaining 
isolate had not experienced a site-specific recombination event and speared to have gained 
uracil prototrophy by recombination between pLT45 and pLT43. Of these eight isolates, 
5 half were selected as both Ura^ and His^, but no significant diflFerence was found between 
this group and the group selected for Ura* only. 

From transformation experiments plated in the presence of vitamin Bl , an 
equal number of Ura^ transformants was examined by DNA hybridization. The thiamine- 
repressible Pnmt promoter is expected to limit integrase production, and thereby site-specific 

10 mtegration. Two of the eight Ura* candidates isolated firom this low fi-equency 

transformation showed a band of 23 kb hybridizing to leul and to the ura4 probe. However, 
since both probes detected an additional band, they do not represent correct integration 
events, and we grouped them as class b integrants. In the other six isolates, the hybridization 
patterns are difiBcult to interpret In some of them, the 3 kb band was not detected by the 

15 leul probe, as though the locus has experienced some rearrangement. In many of them, the 
weak hybridization to ura4 suggests that the Ura* phenotype may not be due to the stable 
maintenance of pLT45 in the genome. 

To ascertain the proportion of transformants maintaining the integrase 
plasmid in the absence of selection, the blots were re-probed with the integrase gene 

20 sequence. Those selected as Ura^ His^ would be expected to maintain the plasmid, and did 
so, as the hybridization revealed. Five of the eight isolates selected as Ura^ Mdthout regard to 
the His phenotype also gave bands hybridizmg to the integrase probe. To confirm that loss 
of int would not affect stable integration, another set of randomly chosen Ura"^ cells were 
grown non-selectively for a number of generations and screened for His" progeny that have 

25 lost pLT43. The analysis of eight representative Ura^ His* clones showed that all had a 
single copy of pLT45 precisely iategrated at the chromosome-situated attP site. The DNA 
of these integrants did not hybridize with the integrase probe. In contrast, the background 
ftequency Ura* clones derived by transformation of pLT45 alone gave the parental 
configuration of hybridizing bands at the leul locus and additional faint bands at 5 kb and 7 

30 kb. These observations are consistent vnfh either integration of pLT45 elsewhere in the 

28 



wo 01/07572 



PCT/USOO/19983 



genome, or maintenance of the plasmid in some cells despite the lack of a S, pombe 
replication origin. 

Conservative site-specific recombination 

PCR was used to retrieve the attPlattB recombinant junctions fiom three 
5 representative Ura^ candidates. One of the hybrid sites, attR (PB') would be flanked by T3 
and T7 promoters; the other site, attL (BP') by the T3 promoter and ura4 DNA (Figure IC). 
In each case, primer pairs directed to these sequences anq)lified a band of the expected size 
while the original attP (PP*) was no longer found. This contrasts with the parental strain 
FY527attP, where attP, but neither attL nor aft/?, was detected. The nucleotide sequence of 
1 0 three representative attL and attR PCR products showed the absence of accompanying 
mutations. Hence, as in bacteria and mammalian cells, 4C31 mediated site-specific 
recombination in S, pombe is a conservative recombination reaction. 

iX31 integrase does not excise integrated molecules 

Thorpe and Smith ((1998) Proc. Natl Acad. Set, USA 95: 5505-5510) did not 

1 5 detect reversal of the <1>C3 1 integrase reaction by analysis of gel-firactionated DNA 

fragments. We exaniuned the possibility of a reverse reaction through a genetic selection 
strategy. The precise integration of pLT45 into FY527attP was confirmed for three clones 
by Southern analysis; these strains were then re-transformed with pLT43. Excision of 
pLT45 would result in loss of the ura4^ marker, the Ura' phenotype can be scored on plates 

20 with 5-FOA (Grimm et al (1988) Mol Gen. Genet 215: 81-86). The frequencies of Ura* 
segregants from cultures of the three Ura^ His' progenitors were 5.7 x 10"^, 7.1 x 10"^ and 5.6 
X 10"*. In contrast, the frequencies of Ura' colonies from the three Ura^ His^ derivatives were 
somewhat higher: 1.1 x 10"^ 3.8 x 10'^ and 2.3 x 10'^, respective 19-, 5- and 4-fold increases. 
When a control vector lacking the integrase gene, pBG2, was used instead, increased rates 

25 of 5-FOA resistance were also found: 1.0 x 10 ^ 1.0 x 10 ^ and 8.0 x 10■^ respectively. The 
transformation process itself appears mutagenic. 

Three Ura" His^ clones fi:om each of the three cultures that had been 
transformed by pLT45 were analyzed by Southern blotting. One isolate had a DNA pattern 
consistent with stable mtegration of pLT45 into FY527attP. Therefore, in this clone, the 

30 Ura* phenotype was caused by a mutation that did not ^preciably alter the restriction 
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pattern, rather than by reversal of the site-specific recombination reaction. The second clone 
showed a Southern pattern characteristic of FY527attP lacking a pLT45 insertion, the third 
had a pattern consistent with a mixture of two types of cells, those like FY527attP without a 
pLT45 insertion, and those like the FY527attP progenitor strain FY527. The latter structure 
5 could arise from intrachromosomal homologous recombination betwem the leu repeats, 
reversing the insertion of pLT44 (Figure lA), If precise excision of the integrated plasmid 
DNA occurred in the latter two candidates, the attP site would be regenorated; this would be 
detectable with PGR. The size of the PGR product was that expected for an intact hybrid 
site, the presence of the hybrid site was confirmed by sequencing the PGR product. These 
1 0 observations are consistent with the idea that deletion of the uraA gene occurred by some 
mechanism other than OC31 -mediated excision. 

Summary 

The integration of a circular molecule at a single target site was an efficient 
process yielding precise insertions in nearly all transfonnants. The few aberrant events we 

1 5 observed are probably largely attributable to the S. pombe recombination system acting on 
the leul repetitive DNA. When integrase production was limited through the repression of 
its promoter, the number of transformants was reduced to near backgroimd level Under 
these conditions, few of the recovered transformants were derived fix)m OC31 site-specific 
recombination. Functional operation of the <1>C31 site-specific recombination system in 

20 eukaryotic cells presents new opportunities for the manipulation of transgenes and 

chromosomes. The OC3 1 system can be used with selective placement of attB and attP sites 
to delete, invert or insert DNA. An important feature of this system is that the attB x attP 
reaction is irreversible in the absence of an excision-specific protein. 

Example 2 

25 The OC31 Integrase Functions in CHO CeUs to Create Stable Integration 

This Example describes an experiment in which the OG3 1 integrase was 
tested for ability to mediate recombination between attB and attP recombination sites in 
Ghinese hamster ovary (GHO) cells. 
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Methods 

The CHO cell line 51 YT21 1 was transfected wifli the affP-containing plasmid 
pFYl , which included a selectable marker that confers zeocin resistance (Figure 2), After 
being single colony purified twice, six zeocin resistant cell lines were isolated. Analysis by 
Southern DNA hybridization confirmed that each of the six cell lines had at least one 
molecule of pFYl integrated into the genome. 

Each of the six cell Knes was transfected with the a/rff-containing plasmid 
pFY9 and the inr-containing plasmid pFY6 to test for site-specific recombination between 
the attB sites on pFY9 and the attP site on the chromosomal copy of pFYl. As control, the 
same cell lines were transfected with pFY9, but without the //ir-containing pFY6. The pFY9 
plasmid included a neomycin resistance selectable marker under the control of an SV40 
early promoter, as well as a green fluorescent protein (GFP) coding sequence that is not 
linked to a promoter. Site-specific recombination would thxis be expected to place the GFP 
coding sequence under the control of a human cytomegalovirus promoter that was included 
in pFYl , resulting in expression of GFP. 

Results 

Transfection results: Neomycin resistant colonies were placed under the 
microscope to observe whether the GFP gene is active. A large percentage of the cells 
transfected with pFY9+pFY6, but only a few of the cells transfected with the pFY9 alone 
showed GFP activity. This is consistent with site-specific integration of pFY9 when co- 
transfected with pFY6, and random insertion of pFY9 in the absence of a co-transfected int 
gene. 

PGR analysis was conducted using a primer set that corresponds to Pc 
(human cytomegalovirus promoter) and GFP (Figure 2). These primers would be expected 
to ampUiy a band of --0.6 kb corresponding to the integration junction. As neomycin 
resistant colonies could arise fi:om both site-specific integration and random integration, and 
that the GFP marker does not confer a selectable trait, it was diflScult to obtain pure cultures 
of integrant clones. Therefore, pools of neomycin resistant cells fix)m each transfected line 
were subjected to PGR analysis to examine if the integration junction were present among 
the neomycin resistant cells. A band of the expected size of --0.6 kb was obtained from two 
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lines. This indicates that the attB x attP recombination junction has fonned linking Pc with 
GFP. 

Example 3 

OC31 Integrase Catalyzes Site-Specific Recombination in CHO Cells 

5 This Example describes a second experiment in which the OC3 1 integrase 

was tested for ability to integrate a DNA molecule into the chromosome of Chinese hamster 
ovary (CHO) cells through the recombination between attP and attB sites. 

Methods 

Plasmid constructs 

10 Chromosomal attB target constructs pFY12. pFY14 and pFY15 

The plasmid pcDNA3. 1/His/lacZ (Invitrogen) was used as a vector backbone. 
A synthetic oligonucleotide contained different length of the attB site, flanked by HinSBl 
and Kpnl sites, was inserted between the HinSBl (AAGCTT) and Kpnl (GGTACC) sites of 
pcDNA3.1/HisAacZ. 

15 The plasmid pFY12 contains 90 bp of the attB sequence (AAGCTT 

gacggtctcg aagccgcggt gcgggtgcca gggcgtgccc ttgggctccc cgggcgcgta ctccacctca cccatctggt 

ccatcatgat GGTACC) (SEQ ID NO: 2). 

The plasmid pFY14 contained 50 bp of the attB site (AAGCTT gcgggtgcca 

gggcgtgccc ttgggctccc cgggcgcgta ctccacctca TGGTACC) (SEQ ID NO: 3). 
20 The plasmid pFYl 5 contained 30 bp of attB (AAGCTT ccagggcgtg 

cccttggjgct ccccgggcgc ATGGTACQ (SEQ ID NO: 4). 

Integrating attP plasmids pFY17. pFY19, pFY20 

The hpt gene encoding for resistance to hygromycin, obtained as a 1.6 kb 

BamBi to Kpnl fragment from pEDl 13, was inserted between tiie BamHl and A^nl sites of 
25 pBluescript n SK to generate the control plasmid pBSK-hpt. 

A synthetic oligonucleotide containing different lengths of the attP site was 

inserted between Sad (GTCGAC) and BarnHl (GGATCC) sites in pBSK-hpt to generate the 

following plasmids: 
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a) The plasmid pFY17 contains 90 bp (GAGCTC-g aagcggttt tcgggagtag- 
tgccccaact ggggtaacct ttgagttctc tcagttgggg gcgtagggtc gccgacatga cacaaggggt-GGATCQ of 
a^/Psite(SEQIDNO: 5). 

b) The plasmid pFyi9 contains 50 bp ofattP site fGAGCTC-t gccccaact 
5 ggggtaacct ttgagttctc tcagttgggg gcgtagggtc -GGATCQ (SEQ ID NO: 6). 

c) The plasmid pIT20 contains 32 bp of artP site (GAGCTC-actggggtaa 
cctttgagtt ctctcagt tg ggATCO (SEQ ID NO: 7) is called pFY20. 

Inteprase expressing construct pFY6 

An EcoRI to BamHl fragment containing the nearly complete open reading 
10 frame of the integrase gene was inserted between the EcoRI and BamHi sites of 

pcDNA3.1/Zeo(-) (Invitrogen). A synthetic oligonucleotide (GGGCCCGCCACGATGACA 
CAAGGGGTTGTGACCGGGGTGGACACGTACGCGGGTGCTTACGACCGTCAGTCG 
CGCGAGCGCGAGAATTC) (SEQ ID NO: 8) containing a Kozack sequence and the N- 
terminal amino acid coding sequences of the integrase gene was subsequently inserted 
15 between the Apal and EcoRI sites to reconstruct the open reading frame. This orientation 
places a complete integrase coding region under the control of the CMV (himnian 
cytomegalovirus) promoter in pcDNA3. l/Zeo(-), 

Transf action protocol 

The CHO cell Une K-1 was transfected with attB target constructs pFY12, 
20 pFY14 or pFY15 (Figure 3). These plasmids harbor the selectable marker for neomycin 
resistance, and an attB site of various lengths located between Pc (human cytomegalovirus 
promoter) and the lacZ coding region. Plasmids pFY12, pFY14 and pFY15 contain, 
respectively, 90, 50 and 30 bp of the attB sequence. Neomycin-resistant cell lines were 
obtained from consecutive purification of single colonies. Four lines of each construct were 
25 used for integration experiments. 

Each of the 12 lines was transfected with pFY6, a <bCZ I integrase expression 
plasmid, along with an integration vector, pFY17, pFY19, orpFY20. The plasmids pFY17, 
pFY19 and pFY20 harbor an attP sequence of lengths 90, 50 and 32 bp, respectively. The 
attP sequence is situated iq)stream of the hpt open reading fi^e, which encodes 
30 hygromycin phosphotransferase, an enzyme that confers resistance to hygromycin. There is 
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no promoter upstream of the attP-hpt segment and hpt is therefore not expressed unless the 
plasmid integrates into the genome in such a way that the hpt coding region fuses with a 
genomic promoter. For control, pBSK-hpt was used to monitor the frequency of promoter 
fusion to hpt. The plasmid pBSK-hpt is identical to pFY17, pFY19, and pFY20 ^cept it 
5 lacks an attP sequence. The recombination between attP and attB sites is expected to insert 
the integration vector into the chromosome target to gen^ate a Pc-attL-hpt linkage. 
Expression of hpt will confer resistance to hygromycin. 

Results 

Transfection results: Hygromycin resistant colonies were scored for each 
10 integration plasmid which was transfected into the 12 cell lines (Table 2). From 1x10^ cells 
plated, pBSK-hpt transfections failed to produce a significant number of resistant colonies. 
This indicates that the frequency of &e hpt coding region fusing to a genomic promoter is 
extremely low. fii contrast, pFY17, pFY19 and pFY20 yielded up to a thousand fold higher 
number of hygromycin resistant colonies, depending on the particular integration plasmid 
1 5 and the particular cell line. Higher numbers of hygromycin resistant colonies were produced 
from the transfection of pFY19 or pFY17 into FY12 lines. This indicates that the 
recombiaation between longer attB and attP sequences is more eflBcient than the 
recombination between shorter attB and attP sites. 

PCR was used to detect the expected --0.8 Kb junction band from 
20 representative colonies. Primers corresponding to the human cytomegalovirus promoter and 
the hpt coding region amplified a PCR product of the expected size (0.8 kb). This indicates 
that Pc is linked to the hpt coding region, consistent with recombination between the 
gmomic attB site and the plasmid attP sequence. 

Example 4 

25 The OC31 Integrase Functions in Plant Chromosomes to Recombine attP and attB Sites 
This example describes an experiment in which the <I>C3 1 integrase was 
tested for ability to recombiae attP and attB sites that are present in a plant chromosome. 
The constructs and strategy for this experiment are shown in Figure 4. 



30 



34 



wo 01/07572 



PCT/USOO/19983 



Table! 

Number of hygromycin resistant colonies per 1x10^ transfected cells. 





Integration plasmids 


Taiget cell lines 


pFY17 


dFY19 
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(attB-90) 










FY12-1 


ZDO 




275 


0 


FY12-2 


976 


896 


185 


0 


FY12-3 
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1 1 j| 
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FY12-4 
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FY14-3 
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FY14-7 


96 


245 


67 


0 


FY14-8 
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FY14-9 


89 


255 


78 
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FY15-1 


0 


24 


0 


0 


FY15-2 


0 


345 


34 


0 


FY15-3 


55 


455 


23 


1 


FY15-4 


0 


0 


0 


0 
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Methods 

The construct pWP29 contains the fiagment consisting of SSS^ttP-npt-attB- 
gus, flanked by RB and LB, where 555 is the cauliflower mosaic virus promoter, npt is the 
coding region for neomycin phosphotransferase, and gus is the coding region for 
glucuronidase. RB and LB are the right and left Agrobacteriimi T-DNA border sequences, 
respectively. The attP site between 35S and npt serves as a non-translated leader sequence. 
Transcription of npt by 35S confers resistance to kanamycin. The gus coding region is not 
transcribed due to the lack of an upstream promoter. 

A second construct used for plant transformation is pWP24. This construct 
contains the fragment Pnos-npt-SSS-int^ flanked by RB and LB, wh^e Pnos is the nopaline 
synthase promoter, and int is the OC3 1 integrase coding region. Both npt and int are 
transcribed from their respective iq)stream promoters. 



35 



10 



15 



wo 01/07572 



PCT/USOO/19983 



If the two constructs were present in the same genome, the expression of int 
from the pWP24 bearing chromosome would be expected to produce functional OC31 
integrase to catalyze the recombination between attB and attP sites situated on the pWP29- 
bearing chromosome. The recombination event would be expected to delete the npt gene 
5 from the pWP29 construct and fuse 35S to gus. The resulting configuration would be 35S- 
attR'gus, where ottR is a hybrid site formed by the recombination between atiP and attB, 
also designated as PB* (Figure 4). The deletion of npt brings gus under the transcription of 
35 S and would be expected to yield plants with GUS enayme activity. This activity can be 
detected through histochemical staining of the plant tissue. 

10 Results 

A transient expression assay was conducted to determine whether pWP29 
was functional for recombmation. Through the biolistics-mediated delivery of naked DNA, 
pWP29 was cointroduced with pWP8 into maize BMS cells. The construct pWP8 has the 
integrase gene fused behind the maize ubiquitin promoter for expression in monocot cells. 

1 5 Blue spots were observed when both plasmids were co-introduced, but were not found if 
only one of the plasmids was used. This indicated that site-specific recombination took 
place in maize cells and that the attP and attB sites in pWP29 were functional sites. 

Kanamycin resistant tobacco plants were regenerated by Agrobacterium- 
mediated transformation using pWP29 or pWP24. Another transient expression assay was 

20 conducted to determine whether the pWP24 lines produced functional integrase. The 

construct pWP29 was introduced into the pWP24 plants through biolistics mediated delivery 
of naked DNA Cells that take up the pWP29 DNA would be expected to express GUS 
enzyme activity as a result of the formation of a 35S-attR'gus configuration. Indeed, two 
lines, 24.3 and 24.4 yielded blue spots consistent of functional integrase-mediated site- 

25 specific recombination between the attP and attB sites. 

These two pWP24 integrase lines were crossed to pWP29 tester lines to 
produce progeny with the chromosomes carrying pWP29 and pWP24 m the same genome. 
Table 3 summarizes the results fi*om the genetic crosses between integrase (24,3, 24.4) and 
tester lines (29.2, 29.4,29.5, 29.19). In each case, representative progeny seedlings were 

30 germinated in the absence of selection and histochemically stained for GUS enzyme activity. 
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The table lists the number of progeny that stained blue. As the primary transformed pWP24 
and pWP29 lines are hemizygous for their respective transgene, only a quarter of the 
progeny would be expected to carry both transgene types. The sample sizes were small, so 
an apparent deviation from the expected frequency is not unusual. 

5 

Table 3: Progeny that showed gus expression from histochemical staining. 



Male Donor plant Female Recipient 


Number of 


Number of 


% positive for 


line 


plant line 


progeny stained 


progeny that 


gus activity 






for gus activity 


show gus 










activity. 




24.3+ 


29.2 


38 


11 


29% 


29.2 


24.3+ 


38 


1 


2.6% 




29.4 


24.3+ 


18 


3 


16% 




24.3+ 


29.5 


38 


4 


10% 


29.5 


24.3+ 


26 


0 


0 




24.4 


29.2 


38 


7 


19% 


29.2 


24.4+ 


38 


7 


19% 




29.4 


24.4+ 


19 


8 


42% 




24.4+ 


29.5 


38 


17 


45% 


29.5 


24.4+ 


20 


6 


30% 




29.19 


24.4+ 


18 


7 


39% 



The intensity of staining varied depending on the combination of lines used as 
1 0 parental lines. Those with progeny with a greater proportion of the tissue staining blue 

indicate that the recombination event was more eflScient. Conversely, those yielding progeny 
with less uniform staining indicate that the recombination event was less eflScient. This 
variation among the different progeny pools is probably due to effects caused by the position 
of integration of the transgenes. Of the two integrase lines, 24.4 appears more efficient in 
15 promoting site-specific recombination. This is probably due to a higher level of int gene 

expression. Staining patterns produced by crossing 24.4 to 29.4 and 29.19 are consistent with 
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the experimental design that int promoted site-specific recombination of attB and attP results 
in the activation of gus gene activity. 

Example 5 

OC31 Integrase Catalyzes Integration of a Circnlar Plasmid into a Plant Chromosome 

5 This example describes an experiment in which the <t>C3 1 integrase was 

tested for ability to insert a circular plasmid molecule into the plant chromosome through 
attP X attB site-specific recombination. This experiment is diagrammed in Figure 5. 

Methods 

The target construct pWP6 contains the firagment consisting of 3 SS-attP-npt, 
1 0 flanked by RB and LB. The attP site between 35S and npt serves as a non-translated leader 
sequence. Transcription of npt by 35 S confers resistance to kanamycin. 

The integrating construct pYJC43 has the Augment attB-hpt, where hpt codes 
for resistance to hygromycm. The integrase expression construct is pYJC41, in which 35S 
transcribes int. 

1 5 The target construct pWP6 was placed into a plant chromosome through 

random integration of pWP6 DNA. Kanamycin resistant plants harboring a single copy of 
the pWP6 transgene are then subsequently transforaied with pYJC43 and pYJC41 , The 
transient expression of int 6om pYJC41 was expected to catalyze the recombination 
between the attB site of pYJC43 and the chromosomally-situated attP site of the pWP6 

20 transgene. The specific recombination between attB and attP sites would insert the pYJC43 
circular molecule into the chromosome to generate a SSS-attL-hpt linkage. Note that 
because the attP and sites are depicted in the inverted orientation, the ottL site will 
likewise be in an inverted orientation, or designated P*B, the same as BP' in the drawn in an 
inverted orientation. A fimctional 35S'attL'hpt linkage would confer a hygromycin 

25 resistance phenotype. 

Results 

Kanamycin resistant tobacco plants harboring pWP6 were obtained through 
Agrobacterium-mediated transformation. Southern hybridization analysis detected one line 
that harbors a single copy of the pWP6 transgene. Progeny firom this line, WP6. 1, were 
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germinated asq)tically and protoplasts were made from these plants. The protoplasts were 
transformed by the combination of pYJC43 and pYJC41 DNA by the polyethylene glycol 
method for direct DNA transformation. The protoplasts were then imbedded into agarose 
and cultured to form calli in the presence of hygromycin. The rate of callus formation in the 
5 absence of hygromycin selection was 4 x 10"^, This is about 10 fold lower than usual, but is 
within the range of variabiUty observed in protoplast transformation experiments. In the 
presence of hygromycin selection, the rate of callus formation was 7 x lO'^. This indicates 
that about 18% of the calli that regenerated from protoplasts contained the integration vector 
at the target site. When the integrase construct pYJC41 was excluded from the 
10 transformation, the rate of callus formation was <1 x 10"^. The higher frequency of 
hygromycin resistant calli produced by inclusion of the integrase expressing plasmid 
pYJC41 is consistent with the integrase promoted site-specific integration of pYJC43 into 
the chromosomal attP target. 

IS It is understood that the examples and embodiments described herein are for 

illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of 
this application and scope of the appended claims. All publications, patents, and patent 
appUcations cited herein are hereby incorporated by reference for all purposes. 
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WHAT IS CLAIMED IS : 

1 LA eukaryotic cell that comprises a prokaryotic recombinase polypeptide 

2 or a nucleic acid that encodes a prokaryotic recombinase, wherein the recombinase can 

3 mediate site-specific recombination between a first recombination site and a second 

4 recombination site that can serve as a substrate for recombination with the first 

5 recombination site, but in the absence of an additional factor that is not present in the 

6 eukaryotic cell cannot mediate recombination between two hybrid recombinase 

7 recombination sites that are formed upon recombination between the first recombination site 

8 and the second recombination site. 

1 2. The eukaryotic cell of claim 1 , wherein the recombinase is selected 

2 &om the group consisting of a bacteriophage OC31 integrase, a coliphage P4 recombmase, a 

3 Listeria phage r^ombinase, a bacteriophage R4 Sre recombinase, a Cis A recombinase, an 

4 XisF recombinase, and a transposon Tn4451 TnpX recombinase. 

1 3. The eukaryotic cell of claim 1 , wherein the recombinase is a 

2 bacteriophage OC31 integrase. 

1 4. The eukaryotic cell of claim 1, wherein the first recombination site is an 

2 attB site and the second recombination site is an attP site. 

1 5. The eukaryotic cell of claim 1 , wherein the cell fiirther comprises a first 

2 recombinase recombination site. 

1 6. The eukaryotic cell of claim 1 , wherein the cell comprises a nucleic acid 

2 that comprises a coding sequence for an recombinase polypeptide, which coding sequence is 

3 operably linked to a promoter that mediates expression of the recombinase-encoding 

4 polynucleotide in the eukaryotic cell. 

1 7. The eukaryotic cell of claim 6, wherein the nucleic acid fiuther 

2 comprises a selectable marker. 
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1 8. The eukaryotic cell of claim 6, wherein the promoter is an inducible or a 

2 repressible promoter. 

1 9, The eukaryotic cell of claim 8, wherein the nucleic acid is the plasmid 

2 pLT43. 

1 10. The eukaryotic cell of claim 1 , wherein the eukaryotic cell is selected 

2 from the group consisting of an animal cell, a plant cell, a yeast cell, an insect cell and a 

3 fungal cell. 

1 1 L The eukaryotic cell of claim 1 0, wherein the eukaryotic cell is a 

2 mammalian cell. 

1 12. The eukaryotic cell of claim 10, wherein the eukaryotic cell is present in 

2 a multicellular organism. 

1 13. A method for obtaining site-specific recombination in a eukaryotic cell, 

2 the method comprising: 

3 providing a eukaryotic cell that comprises a first recombiuation site and 

4 a second recombination site, which second recombination site can save as a substrate for 

5 recombination with the first recombination site; 

6 contacting the first and the second recombination sites with a 

7 prokaryotic recombinase polypeptide, resulting in recombination between the recombination 

8 sites, thereby forming one or two hybrid recombination sites; 

9 wherein fhe recombinase polypeptide can mediate site-specific 

10 recombination between the first and second recombination sites, but cannot mediate 

1 1 recombination between two hybrid recombination sites in the absence of an additional factor 

12 that is not present in the eukaryotic cell. 

1 14. The method of claim 13, wherein the eukaryotic cell is selected from the 

2 group consisting of a yeast cell, a fungal cell, a plant cell, an insect cell and an animal cell. 
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1 15. The method of claim 13, wherein the first recombination site is present 

2 in a chronaosome of the eukaryotic cell. 

1 16. The method of claim 1 5, wherein the second recombination site is 

2 present in a second chromosome of the eukaryotic cell and contacting the first and second 

3 recombination sites with the recombinase results in translocation of chromosome aims. 

1 1 7. The method of claim 1 3, wherein the first recombination site and the 

2 second recombination site are present on a single nucleic acid molecule. 

1 18. The method of claim 1 7, wherein the first recombination site and the 

2 second recombination site are in a direct orientation. 

1 19. The method of claim 1 8, wherein the recombination results in excision 

2 of the portion of the nucleic acid molecule that lies between the first and second 

3 recombination sites, 

1 20. The method of claim 17, wherein the first recombination site and the 

2 second recombination site are in an inverted orientation. 

1 21. The method of claim 20, wherein the recombination results in inversion 

2 of the portion of the nucleic acid molecule that lies between the first and second 

3 recombination sites. 

1 22. The method of claim 13, wherein the eukaryotic cell comprises a 

2 polynucleotide that encodes the recombinase polypeptide. 

1 23. The method of claim 22, wherein the recombinase-encoding 

2 polynucleotide is operably linked to a promoter which mediates expression of the 

3 polynucleotide in the eukaryotic cell. 
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1 24. The method of claim 23, wherein the promoter is an inducible or a 

2 repressible promoter. 

1 25. The method of claim 24, wherein the promoter is a Pmnt promoter. 

1 26. A method for obtaining a eukaryotic cell having a stably integrated 

2 transgene, the method comprising: 

3 introducing a nucleic acid into a eukaryotic cell that comprises a first 

4 recombination site, wherein the nucleic acid comprises a transgene and a second 

5 recombination site which can serve as a substrate for recombination with the first 

6 recombination site; and 

7 contacting the first and the second recombination sites with a 

8 prokaryotic recombinase polypeptide, wherein the recombinase polypeptide catalyzes 

9 recombination betwem the first and second recombination sites, resulting in integration of 

10 the nucleic acid at the first recombination site, thereby forming a hybrid recombination site 

11 at each end of the nucleic acid; 

12 wherein the recombinase polypeptide can mediate site-specific 

13 recombination between the first and second recombination sites, but cannot mediate 

14 recombination between two hybrid recombination sites in the absence of an additional factor 

15 that is not present in the eukaryotic cell. 

1 27. The method of claim 26, wherein the recombinase polypeptide is 

2 selected fiom the group consisting of a bacteriophage OC3 1 integrase, a coliphage P4 

3 recombinase, a Listeria phage recombinase, a bacteriophage R4 Sre recombinase, a CisA 

4 recombinase, an XisF recombinase, and a transposon Tn4451 TnpX recombinase. 

1 28. The method of claim 27, wherein the recombinase is a <I)C3 1 integrase. 

1 29. The method of claim 26, wherein the recombinase polypeptide is 

2 introduced into the eukaryotic cell by expression of a polynucleotide that encodes the 

3 recombinase polypeptide. 
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1 30. The method of claim 29, wherein the polynucleotide that encodes the 

2 recombinase polypeptide is operably linked to a promoter that functions in the eukaryotic 

3 cell. 

1 31. The method of claim 30, wherem the promoter is an inducible or a 

2 repressible promoter, 

1 32. A nucleic acid that comprises a polynucleotide sequence that encodes a 



2 bacterial recombinase polypq)tide operably linked to a promoter that functions in a 

3 eukaryotic cell, wherein the recombinase polypeptide cannot mediate recombination between 

4 two hybrid recombination sites that are formed upon recombination between a first 

5 recombination site and a second recombination site in the absence of an additional factor. 



1 33. The nucleic acid of claim 32, wherein the nucleic acid further comprises 

2 at least one recombination site that is recognized by the recombinase polypeptide, 

1 34. The nucleic acid of claim 32, wherein the nucleic acid comprises a 

2 plasmid vector. 

1 35, The nucleic acid of claim 34, wherein the vector is pLT43, 

1 36. A eukaryotic cell that comprises a polynucleotide that comprises a first 

2 bacteriophage OC3 1 recombination site. 

1 37. The eukaryotic cell of claim 36, wherein the recombination site is 

2 selected firom the group consisting of attP and attB. 

1 38. The eukaryotic cell of claim 36, wherein the eukaryotic cell further 

2 comprises a second polynucleotide that conq)rises a second OC31 recombination site that 

3 undergoes recombination with the first OC31 recombination site when contacted with a 

4 OC3 1 integrase polypeptide. 
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1 39. The eukaryotic ceU of claim 38, wherein: 

2 the first recombination site is attB and the second recombination site is 

3 attP; or 

4 the first recombination site is attP and the second recombination site is 

5 attB. 

1 40. The eukaryotic cell of claim 38, wherein the second polynucleotide 

2 further comprises a transgene. 

1 41 . The eukaryotic cell of claim 38, wherein the second polynucleotide 

2 further comprises a selectable marker. 

1 42. The eukaryotic cell of claim 36, wherein the eukaryotic cell further 

2 comprises a OC3 1 integrase polypeptide. 

1 43. The eukaryotic cell of claim 36, wherein the eukaryotic cell further 

2 comprises a nucleic acid that comprises a polynucleotide that encodes a OC3 1 integrase 

3 polypeptide. 

1 44. The eukaryotic cell of claim 43, wherein the nucleic acid further 

2 comprises a selectable marker. 

1 45. The eukaryotic cell of claim 43, wh^ein the nucleic acid further 

2 comprises a promoter which results in expression of the OC3 1 integrase-encoding 

3 polynucleotide in the cell. 

1 46. The eukaryotic cell ofclaim 45, wherein the promoter is an inducible 

2 promoter. 

1 47. The eukaryotic cell of claim 36, wherein the eukaryotic cell is selected 

2 6om the group consisting of a yeast cell, a fimgal cell, a plant cell, and an animal cell. 
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Figure 2 

Transgene Integration in CHO Cell Line 
Site Specific Expression of GFP 
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Figure 3 

Transgene Integration in CHO Cell Line 
Hygromycin Resistance from atiB x attP recombination 
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Figure 4 

Excision of DNA from Tobacco Genome 
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Figure 5 

Integration of DNA into the tobacco genome 
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